Exploiting Keyword Structure for Domain-Specific Retrieval
نویسنده
چکیده
Structured elements, such as manually assigned keywords or key-phrases in scientific collections, are pervasive in digital libraries. Special dictionaries or thesauri for the meta-information are not always available. Our strategy is to compute the similarity of keywords based on their occurrence in the collection. The resulting keyword space is brought to bear on a variety of tasks. Combined with an information retrieval system, we can recover keywords for queries, and thus provide a technique can be used for automatic classification. Moreover, it can be used to rerank retrieved documents, leading to a significant improvement of retrieval effectiveness in domain-specific collections. Experimental evaluation is done on the German GIRT and French Amaryllis collections, using the test-suite of the Cross-Language Evaluation Forum (CLEF).
منابع مشابه
Improving relevance in search through Ontology and Query Expansion
Prof. Pushpak Bhattacharyya Computer Science and Engineering department Bachelor of Technology Improving relevance in search through Ontology and Query Expansion by Anirudh Vemula From the inception of Semantic Web in the late 20th century, ontology has been a major focus to achieve the idea of semantic search. In this work, we will review different approaches that have been employed over the y...
متن کاملSemiautomatic Image Retrieval Using the High Level Semantic Labels
Content-based image retrieval and text-based image retrieval are two fundamental approaches in the field of image retrieval. The challenges related to each of these approaches, guide the researchers to use combining approaches and semi-automatic retrieval using the user interaction in the retrieval cycle. Hence, in this paper, an image retrieval system is introduced that provided two kind of qu...
متن کاملDocument Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملExploiting Domain Thesaurus for Medical Record Retrieval
InfoLab at the University of Delaware participated in the TREC 2012 Medical Records Track. This paper explains our method and describes experiment results. One limitation of existing keyword matching based retrieval functions is the problem of vocabulary mismatch. To overcome this limitation, we propose to first map topics and visits to bags of concepts using domain thesaurus, and then model th...
متن کاملExpanding Queries Using Stems and Symbols
This paper describes the experiments conducted in the ad-hoc retrieval task of the Genomic track at TREC 2004. Different query expansion techniques based on the addition of keyword stems and of genomic product symbols selected by relevance feedback were studied. Stemming was tested using a mutual reinforcement process for building a domain-specific stemmer. Relevance feedback was tested using a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002